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Abstract — We show that there exists a universal quantum Tur- 
ing machine (UQTM) that can simulate every other QTM until 
the other QTM has halted and then halt itself with probability one. 
This extends work by Bernstein and Vazirani who have shown 
that there is a UQTM that can simulate every other QTM for 
an arbitrary, but preassigned number of time steps. 

As a corollary to this result, we give a rigorous proof that 
quantum Kolmogorov complexity as defined by Berthiaume et 
al. is invariant, i.e. depends on the choice of the UQTM only up 
to an additive constant. 

Our proof is based on a new mathematical framework for 
QTMs, including a thorough analysis of their halting behaviour. 
We introduce the notion of mutually orthogonal halting spaces 
and show that the information encoded in an input qubit string 
can always be effectively decomposed into a classical and a 
quantum part. 

Index Terms — Quantum Turing Machine, Kolmogorov Com- 
plexity, Universal Quantum Computer, Quantum Kolmogorov 
Complexity, Halting Problem. 



I. Introduction 

ONE of the fundamental breakthroughs of computer sci- 
ence was the insight that there is a single computing 
device, the universal Turing machine (TM), that can simu- 
late every other possible computing machine. This notion of 
universality laid the foundation of modern computer technol- 
ogy. Moreover, it provided the opportunity to study general 
properties of computation valid for every possible computing 
device at once, as in computational complexity and algorithmic 
information theory respectively. 

Due to the development of quantum information theory in 
recent years, much work has been done to generalize the 
concept of universal computation to the quantum realm. In 
1985, Deutsch [1] proposed the first model of a quantum 
Turing machine (QTM), elaborating on an even earlier idea 
by Feynman Q. Bernstein and Vazirani [3] worked out the 
theory in more detail and proved that there exists a QTM that 
is universal in the sense that it efficiently simulates every other 
possible QTM. This remarkable result provides the foundation 
to study quantum computational complexity, especially the 
complexity class BQP. 

In this paper, we shall show that there exists a QTM that is 
universal in the sense of program lengths. This is a different 
notion of universality, which is needed to study quantum 
algorithmic information theory. The basic difference is that 
the "strongly universal" QTM constructed in this paper does 
not need to know the number of time steps of the computation 
in advance, which is difficult to achieve in the quantum case. 

M. Muller is with the Institute of Mathematics, Technical University of 
Berlin (e-mail: mueller@math.tu-berlin.de). 



For a compact presentation of the results by Bernstein 
and Vazirani, see the book by Gruska H. Additional rele- 
vant literature includes Ozawa and Nishimura |5|, who gave 
necessary and sufficient conditions that a QTM's transition 
function results in unitary time evolution. Benioff [6] has 
worked out a slightly different definition which is based on 
a local Hamiltonian instead of a local transition amplitude. 

A. Quantum Turing Machines and their Halting Conditions 

Our discussion will rely on the definition by Bernstein and 
Vazirani. We describe their model in detail in Subsection lH-BI 
Similarly to a classical TlvQ, a QTM consists of an infinite 
tape, a control, and a single tape head that moves along the 
tape cells. The QTM as a whole evolves unitarily in discrete 
time steps. The (global) unitary time evolution U is completely 
determined by a local transition amplitude 5 which only affects 
the single tape cell where the head is pointing to. 

There has been a vivid discussion in the literature on the 
question when we can consider a QTM as having halted on 
some input and how this is compatible with unitary time 
evolution, see e.g. Q, @, 0, Q3, (TTJ- We will not get 
too deep into this discussion, but rather analyze in detail the 
simple definition for halting by Bernstein and Vazirani Q, 
which we also use in this paper. We argue below that this 
definition is useful and natural, at least for the purpose to 
study quantum Kolmogorov complexity. 

Suppose a QTM M runs on some quantum input of 
n qubits for t time steps. The control C of M will then be 
in some state (obtained by partial trace over the all the other 
parts of the QTM) which we denote M^(\ip)). In general, this 
is some mixed state on the finite-dimensional Hilbert space 
Tic that describes the control. By definition of a QTM (see 
Subsection III-Bt . there is a specified final state \qt) £ Tic- 
According to [3|, we say that the QTM M halts at time T on 
input if 

(«/|Afgr»)| g/ ) = 1 and {q^MUmhf) =0 V< < T. 

We can rephrase this definition as M^Qip)) — |<?/)(<7/|, 
i.e. the control is exactly in the final state at time T, and 
supp (Mq(\iP))) _L \qf), i.e. the control state is exactly 
orthogonal to the halting state at any time t < T before the 
halting time. 

In general, the overlap of MqQi/))) with the final state \qf) 
will be some arbitrary number between zero and one. Hence, 
for most input qubit strings there will be no time T G N 

We use the terms "Turing machine" (TM) and "computer" synonymously 
for "partial recursive function from {0,1}* to {0,1}*", where {0,1}* = 
{A,0,1,00,...} denotes the finite binary strings. 
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such that the aforementioned halting conditions are satisfied. 
We call those qubit strings non-halting, and otherwise T- 
halting, where T S N is the corresponding halting time. 

In Subsection IIII-A1 we analyze the resulting geometric 
structure of the halting input qubit strings. We show that 
inputs with some fixed length n that make the QTM M 
halt after t steps form a linear subspace Hf} (i). Moreover, 
inputs with different halting times are mutually orthogonal, 
i.e. H^{t) _L H { $ '(*') if t ^ t'. According to the halting 
conditions given above, this is almost obvious: Superpositions 
of i-halting inputs are again /-halting, and inputs with different 
halting times can be perfectly distinguished, just by observing 
their halting time. 

In Figure [T] a geometrical picture of the halting space 
structure is shown: The whole space M. 3 represents the space 
of inputs of some fixed length n, while the plane and the 
straight line represent two different halting spaces 7i^(t') 
and 7i]^(t). Every vector within these subspaces is perfectly 
halting, while every vector "in between" is non-halting and 
not considered a useful input for the QTM M. 




>| l-halting 



Fig. 1. mutually orthogonal halting spaces. 

At first, it seems that the halting conditions given above are 
far too restrictive. Don't we loose a lot by dismissing every 
input which does not satisfy those conditions perfectly, but, 
say, only approximately up to some small e? To see that it is 
not that bad, note that 

• most (if not all) of the well-known quantum algorithms, 
like the quantum Fourier transform or Shor's algorithm, 
have classically controlled halting. That is, the halting 
time is known in advance, and can be controlled by a 
classical subprogram. 

• we show elsewehere iPTZl (cf. Theorem 13. 15b that every 
input that is almost halting can be modified by adding 
at most a constant number of qubits to halt perfectly, i.e. 
to satisfy the aforementioned halting conditions. This can 
be interpreted as some kind of "stability result", showing 
that the halting conditions are not "unphysical", but have 
some kind of built-in error tolerance that was not expected 
from the beginning. 

Moreover, this definition of halting is very useful. Given 
two QTMs Mi and M 2 , it enables us to construct a QTM M 
which carries out the computations of M\, followed by the 
computations of M 2 , just by redirecting the final state \qj) of 
Mi to the starting state |qo) of M 2 (see [3, Dovetailing Lemma 
4.2.6]). In addition, it follows from this definition that QTMs 



are quantum operations, which is a very useful and plausible 
property. 

Even more important, at each single time step, an out- 
side observer can make a measurement of the control state, 
described by the operators !<?/)(<?/ 1 and 1 — \qf)(qf\ (thus 
observing the halting time), without spoiling the computation, 
as long as the input \tp) is halting. As soon as halting is 
detected, the observer can extract the output quantum state 
from the output track (tape) and use it for further quantum 
information processing. This is true even if the halting time 
is very large, which typically happens in the study of Kol- 
mogorov complexity. Consequently, our definition of halting 
has the useful property that if an outside observer is given 
some unknown quantum state which is halting, then the 
observer can find out with certainty by measurement. 

Finally, if we instead introduced some probabilistic notion 
of halting (say, we demanded that we observe halting of 
the QTM M at some time t with some large probability 
p < 1), then it would not be so clear how to define 
quantum Kolmogorov complexity correctly. Namely if the 
halting probability is much less than one, it seems necessary 
to introduce some kind of "penalty term" into the definition 
of quantum Kolmogorov complexity: there should be some 
trade-off between program length and halting accuracy, and 
it is not so clear what the correct trade-off should be. For 
example, what is the complexity of a qubit string that has a 
program of length 100 which halts with probability 0.8, and 
another program of length 120 which halts with probability 

0. 9? The definition of halting that we use in this paper avoids 
such questions. 

B. Different Notions of Universality for QTMs 

Bernstein and Vazirani [3] have shown that there exists 
a universal QTM (UQTM) U. It is important to understand 
what exactly they mean by "universal". According to [3 Thm. 
7.0.2], this UQTM U has the property that for every QTM M 
there is some classical bit string sm G {0, 1}* (containing a 
description of the QTM M) such that 

\\U(s m ,T,6,\tJj))-K(M^)))\\ Ti <S (1) 

for every input accuracy 5 > and number of time steps 
Tel Here, || - 1 1 Tr is the trace distance, and K (MgQijj))) 
is the content of the output tape O of M after T steps of 
computation (the notation will be defined exactly in Subsec- 
tion llTBl . 

This means that the UQTM U simulates every other QTM 
M within any desired accuracy and outputs an approximation 
of the output track content of M and halts, as long as the 
number of time steps T is given as input in advance. 

Since the purpose of Bernstein and Vazirani's work was 
to study the computational complexity of QTMs, it was a 
reasonable assumption that the halting time T is known in 
advance (and not too large) and can be specified as additional 
input. The most important point for them was not to have short 
inputs, but to prove that the simulation of M by U is efficient, 

1. e. has only polynomial slowdown. 

The situation is different if one is interested in studying 
quantum algorithmic information theory instead. It will be 
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explained in Subsection II-CI below that the universality notion 
(Q]l is not enough for proving the important invariance property 
of quantum Kolmogorov complexity, which says that quantum 
Kolmogorov complexity depends on the choice of the universal 
QTM only up to an additive constant. 

To prove the invariance property, one needs a generalization 
of (Q~|), where the requirement to have the running time T as 
additional input is dropped. We show below in Section [Til] 
that there exists a UQTM H that satisfies such a generalized 
universality property, i.e. that simulates every other QTM until 
that other QTM has halted, without knowing that halting time 
in advance, and then halts itself. 

Why is that so difficult to prove? At first, it seems that one 
can just program the UQTM U mentioned in (Q]) to simulate 
the other QTM M for T — 1,2,3,... time steps, and, after 
every time step, to check if the simulation of M has halted 
or not. If it has halted, then U halts itself and prints out the 
output of M, otherwise it continues. 

This approach works for classical TMs, but for QTMs, there 
is one problem: in general, the UQTM U can simulate M 
only approximately. The reason is the same as for the circuit 
model, i.e. the set of basic unitary transformations that U can 
apply on its tape may be algebraically independent from that 
of M, making a perfect simulation in principle impossible. 
But if the simulation is only approximate, then the control 
state of M will also be simulated only approximately, which 
will force U to halt only approximately. Thus, the restrictive 
halting conditions given above in Equation (O will inevitably 
be violated, and the computation of U will be treated as invalid 
and be dismissed by definition. 

This is a severe problem that cannot be circumvented easily. 
Many ideas for simple solutions must fail, for example the idea 
to let U compute an upper bound on the halting time T of all 
inputs for M of some length n and just to proceed for T 
time steps: upper bounds on halting times are not computable. 
Another idea is that the computation of U should somehow 
consist of a classical part that controls the computation and 
a quantum part that does the unitary transformations on the 
data. But this idea is difficult to formalize. Even for classical 
TMs, there is no general way to split the computation into 
"program" and "data" except for special cases, and for QTMs, 
by definition, global unitary time evolution can entangle every 
part of a QTM with every other part. 

Our proof idea rests instead on the observation that every 
input for a QTM which is halting can be decomposed into a 
classical and a quantum part, which is related to the mutual 
orthogonality of the halting spaces. See Subsection II-EI for 
details. 

C. Q-Kolmogorov Complexity and its Supposed Invariance 

The classical Kolmogorov complexity Cjj(s) of a finite bit 
string s £ {0, 1}* is defined as the minimal length of any 
computer program p that, given as input into a TM M, outputs 
the string and makes M halt: 

C M (s) :=mxn{l(p) \ M(p) = s}. 

For this quantity, running times are not important; all that 
matters is the input length. There is a crucial result that is the 



basis for the whole theory of Kolmogorov complexity (see 
|[T3lD . Basically, it states that the choice of the computer M is 
not important as long as M is universal; choosing a different 
universal computer will alter the complexity only up to some 
additive constant. More specifically, there exists a universal 
computer U such that for every computer M there is a constant 
cm S N such that 

Cu{s)<Cm(s)+c m for every s£ {0,1}*. (2) 

This so-called "invariance property" follows easily from the 
existence of a universal computer U in the following sense: 
There exists a computer U such that for every computer M 
and every input s £ {0, 1}* there is an input s £ {0, 1}* such 
that U(s) = M (s) and t{s) < £(s) + c M , where c M £ N is a 
constant depending only on M. In short, there is a computer 
U that produces every output that is produced by any other 
computer, while the length of the corresponding input blows 
up only by a constant summand. One can think of the bit string 
s as consisting of the original bit string s and of a description 
of the computer M (of length cm). 

The quantum generalization of Kolmogorov complexity that 
we consider in this paper has been first defined by Berthiaume, 
van Dam and Laplante 03). Basically, they define the quantum 
Kolmogorov complexity QC of a string of qubits \ip) as the 
length of the shortest string of qubits that, when given as input 
to a QTM M, makes M output and halt. (We give a 
formal definition of a "qubit string" in Subsection III- Al and of 
quantum Kolmogorov complexity QC in Subsection III-O . 

In fl4l . it is claimed that quantum Kolmogorov complexity 
QC is invariant up to an additive constant similar to (f2]). It is 
stated there that the existence of a universal QTM U in the 
sense of Bernstein and Vazirani (see Equation (Q~|)) makes it 
possible to mimic the classical proof and to conclude that the 
UQTM U outputs all that every other QTM outputs, implying 
invariance of quantum Kolmogorov complexity. 

But this conclusion cannot be drawn so easily, because (Q]) 
demands that the halting time T is specified as additional 
input, which can enlarge the input length dramatically, if 
T is very large (which typically happens in the study of 
Kolmogorov complexity). 

As explained above in Subsection II-BI it is not so easy to 
get rid of the halting time. The main reason is that the UQTM 
U can simulate other QTMs only approximately. Thus, it will 
also simulate the control state and the signaling of halting only 
approximately, and cannot just "halt whenever the simulation 
has halted", because then, it will violate the restrictive halting 
conditions given in Equation ©. As we have chosen this 
definition of halting for good reasons (cf. the discussion at 
the beginning of Subsection II-AI above), we do not want to 
drop it. 

Instead of (U), a stronger notion of universality is needed, 
namely a "strongly universal" QTM il that, as explained 
above in Subsection II-BI simulates every other QTM M 
until the other QTM has halted and then halts itself with 
probability one, as required by the halting conditions given 
in Subsection II-AI Then, the classical proof outlined above 
can be carried over to the quantum situation. In this paper, we 
prove that such a QTM il really exists (Theorem ll.il ). and as 
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a corollary, the invariance property for quantum Kolmogorov 
complexity follows (Theorem 11. 2| >. 

D. Main Theorems 

One main result of this paper is the existence of a "strongly 
universal" QTM that simulates every other QTM until the 
other QTM has halted and then halts itself. Note that the 
halting state is attained by il exactly (with probability one) 
in accordance with the strict halting conditions stated in 
Equation (O. The exact definition of "qubit strings" and the 
output M(a) of M on input a is given below in Section iHl 

Theorem 1.1 (Strongly Universal Q-Turing Machine): 
There is a fixed-length quantum Turing machine il such that 
for every QTM M and every qubit string a for which M (a) 
is defined, there is a qubit string <jm such that 

\\il{a M ,S)-M(a)\\ Tr <6 

for every 6 6 Q + , where the length of <tm is bounded by 
< 1(a) + cm, and cm € N is a constant depending 
only on M. 

Note that um does not depend on S. We conclude from 
this theorem and a two-parameter generalization given in 
Proposition 13.141 that quantum Kolmogorov complexity as 
defined in Ifl4l is indeed invariant, i.e. depends on the choice 
of the strongly universal QTM only up to some constant: 

Theorem 1.2 (Invariance of Q-Kolmogorov Complexity): 
There is a fixed-length quantum Turing machine il such that 
for every QTM M there is a constant cm € N such that 

QCu(p) < QC M (p) + c M for every qubit string p. 

Moreover, for every QTM M and every S, A £ Q + with 5 < 
A, there is a constant cm,s,a S N such that 

Q^'u(p) ^ QCm(p) + c m,<5,a for every qubit string p. 

All the proofs are given in Section [TTTJ while the ideas of the 
proofs are outlined in the next subsection. 

E. Ideas of Proof 

The proof of Theorem 11.11 relies on the observation about 
the mutual orthogonality of the halting spaces, as explained 
in Subsection II- Al Fix some QTM M, and denote the set of 
vectors \tp) 6 (C 2 ) which cause M to halt at time t by 
H$(t). If \ip) € (C 2 )®" is any halting input for M, then 
we can decompose \ip) in some sense into a classical and a 
quantum part. Namely, the information contained in \ip) can 
be split into a 

• classical part: The vector \ip) is an element of which of 
the subspaces 7il9(t)l 

• quantum part: Given the halting time t of \<p), then where 
in the corresponding subspace H^{t) is \ip) situated? 

Our goal is to find a QTM il and an encoding \(p) G 
(C 2 )® ( ™ +1 ' ) of \ip) which is only one qubit longer and which 
makes the (cleverly programmed) QTM il output a good 
approximation of M(\ ip)). First, we extract the quantum part 
out of \<p). While dim(C 2 ) = 2™, the halting space 



H^m ( T ) tnat contains \tp) is only a subspace and might have 
much smaller dimension d < 2™. This means that we need less 
than n qubits to describe the state \<p)\ indeed, [log 2 d~\ qubits 
are sufficient. In other words, there is some kind of "standard 
compression map" C that maps every vector \tp) S Wj^(r) 



into the |~log 2 d~\ -qubit-space 



i [log 2 d] 



Thus, the qubit 



string C\cp) of length [~log 2 ef| < n can be considered as the 
"quantum part" of \ip). 

So how can the classical part of \<p) be encoded into a 
short classical binary string? Our task is to specify what 
halting space (r) corresponds to \(p). Unfortunately, it 
is not possible to encode the halting time r directly, since r 
might be huge and may not have a short description. Instead, 
we can encode the halting number. Define the halting time 
sequence {ti}fL 1 as the set of all integers f £ N such that 
diuiH^j 1 (t) > 1, ordered such that < U+\ for every i, 
that is, the set of all halting times that can occur on inputs of 
length n. Thus, there must be some i £ N such that r = £j, 
and i can be called the halting number of \ip). Now, we assign 
code words Cj to the halting numbers i, that is, we construct 
a prefix code {c{\f =1 C {0, 1}*. We want the code words to 
be short; we claim that we can always choose the lengths as 

t{ Ci ) = n + 1 - [log 2 dimH^iU)] . 

This can be verified by checking the Kraft inequality: 

N N 



i2 dimH^iU)-]-! 



< 2-™^dimW^ ) (i l ) < 2-"dim(i 



^ 2 \ (gin 



< 1, 

since the halting spaces are mutually orthogonal. 

Putting classical and quantum part of \tp) together, we get 

\<p) := a ®C\tp) , 

where i is the halting number of Thus, the length of \tp) 
is exactly n + 1. 

Let sm be a self-delimiting description of the QTM M. 
The idea is to construct a QTM il that, on input sm <8> \(p), 
proceeds as follows: 

• By classical simulation of M, it computes descriptions 
of the halting spaces H^ } (1)> ^'(2), W^(3), ... and 
the corresponding code words c\, c 2 , C3, . . . one after the 
other, until at step t, it finds the code word q that equals 
the code word in the input. 

• Afterwards, it applies a (quantum) decompression map to 
approximately reconstruct \<p) from C\ tp). 

• Finally, it simulates (quantum) for r time steps the time 
evolution of M on input \ip) and then halts, whatever 
happens with the simulation. 

Such a QTM il will have the strong universality property as 
stated in Theorem 11.11 Unfortunately, there are many difficul- 
ties that have to be overcome by the proof in Section [HI] 

• Also classically, QTMs can only be simulated approxi- 
mately. Thus, it is for example impossible for il to decide 
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by classical simulation whether the QTM M halts on 
some input \tp) perfectly or only approximately at some 
time t. Thus, we have to define certain ^-approximate 
halting spaces H.^'^ (t) and prove a lot of lemmas with 
nasty inequalities. 

• Since our approach includes mixed qubit strings, we have 
to consider mixed inputs and outputs as well. 

• The aforementioned prefix code must have the property 
that one code word can be constructed after the other 
(since the sequence of all halting times is not com- 
putable), see Lemma [3. 121 

We show that all these difficulties (and some more) can 
be overcome, and the idea outlined above can be converted 
to a formal proof of Theorem 11.11 and the second part of 
Theorem 11.21 which we give in full detail in Section Hill 

For the first part of Theorem ll.21 concerning the complexity 
notion QC, a more general result is needed which is stated 
in Proposition 13.141 since this complexity notion needs an 
additional parameter as input. For this proposition, the proof 
idea outlined above needs to be modified. The idea for the 
modified proof of that proposition is to make the QTM it 
determine the halting number of the input (and thus the 
halting time) directly by projective measurement in the basis 
of (approximations of) the halting spaces. We will not prove 
Proposition 13 . 1 41 in full detail, but only sketch the proof there, 
since the technical details are similar to that of the proof of 
Theorem 11.11 

II. Mathematical Framework and Formalism 

Here, we introduce the formalism that is used in Section ITTT1 
to describe qubit strings, quantum Turing machines, and quan- 
tum Kolmogorov complexity. We denote the density operators 
on a Hilbert space H by T 1 ^(H) (i.e. the positive trace-class 
operators with trace 1). The natural numbers will be denoted 
N = {1, 2, 3, . . . , }, and we use the symbols N :=NU {0} 
and Kq" := {x S K | x > 0} as well as 5t>t, which shall be 1 
if tf = t and otherwise. 

A. Indeterminate-Length Qubit Strings 

The quantum analogue of a bit string, a so-called qubit 
string, is a superposition of several classical bit strings. To 
be as general as possible, we would like to allow also 
superpositions of strings of different lengths like 

\<p) := -i= (|00) + |11011». (3) 

Such quantum states are called indeterminate-length qubit 
strings. They have been studied by Schumacher and West- 
moreland [15 1, as well as by Bostrom and Felbinger |[T6l in 
the context of lossless quantum data compression. 

Let H n := (C^ ' 1 })®™ be the Hilbert space of n qubits 
(n £ No). We write C {0 ' 1} for C 2 to indicate that we fix 
two orthonormal computational basis vectors |0) and |1). The 
Hilbert space 7i{o,i}* which contains indeterminate-length 
qubit strings like \ip) can be formally defined as the direct 
sum 

oo 

^{0,1}* : = (J) ^fc' 
k=0 



The classical finite binary strings {0, 1}* are identified with 
the computational basis vectors in W{o,i}*, i-e. 7i{o,i}* — 
£ 2 ({A, 0, 1, 00, 01, . . .}), where A denotes the empty string. We 
also use the notation H.< n '■= ®/J = o^ fe anc ^ treat ^ as a 
subspace of 7i{o,i}*- 

To be as general as possible, we do not only allow super- 
positions of strings of different lengths, but also mixtures, i.e. 
our qubit strings are arbitrary density operators on 7i{o.i}*- 
It will become clear in the next sections that QTMs naturally 
produce mixed qubit strings as outputs. Moreover, it will be 
a useful feature that the result of applying the partial trace to 
segments of qubit strings will itself be a qubit string. 

Definition 2.1 (Qubit Strings and their Length): 
An (indeterminate-length) qubit string a is a density operator 
on 7i{o,i}*- Normalized vectors \ip) £ W{o,i}* will also be 
called qubit strings, identifying them with the corresponding 
density operator |i/))('0|. The base length (or just length) of a 
qubit string a € T L + (7i{ ,i}*) is defined as 

i{a) := mas{f(s) | (s\a\s) > 0, s e {0, 1}*} 

or as 1(a) — oo if the maximum does not exist. 
For example, the density operator with \ip) as defined 

in Equation (0) is a (pure) qubit string of length £(\ip)(ip\) = 
5. This corresponds to the fact that this state \<p) needs at 
least 5 cells on a QTM's tape to be stored perfectly (compare 
Subsection lH-BI ). An alternative approach would be to consider 
the expectation value I of the length instead, which has been 
proposed by Rogers and Vedral [17|, see also the discussion 
in Section [TV] 

In contrast to classical bit strings, there are uncountably 
many qubit strings that cannot be perfectly distinguished by 
means of any quantum measurement. A good measure for 
the difference between two qubit strings a and p is the trace 
distance (cf. ifLSll ) 

l|p-Hk:=^|p-^ = ^EW' (4) 

i 

where the A; are the eigenvalues of the trace-class operator 
p— a. Its operational interpretation is that it gives the maximum 
probability of correctly distinguishing between p and a by 
means of any single quantum measurement. 

B. Mathematical Description of Quantum Turing Machines 

Bernstein and Vazirani ([3|, Def. 3.2.2) define a quantum 
Turing machine M as a triplet (£, Q, 5), where £ is a finite 
alphabet with an identified blank symbol and Q is a finite 
set of states with an identified initial state qo and final state 
q f ^ q Q . The function 5 : Q x £ cSxQx{L,/i} ifj called 
the quantum transition function. The symbol C denotes the set 
of complex numbers aeC such that there is a deterministic 
algorithm that computes the real and imaginary parts of a to 
within 2~™ in time polynomial in n. 

One can think of a QTM as consisting of a two-way 
infinite tape T of cells indexed by Z, a control C, and a 
single "read/write" head H that moves along the tape. A 
QTM evolves in discrete, integer time steps, where at every 
step, only a finite number of tape cells is non-blank. For 
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every QTM, there is a corresponding Hilbert space Hqtm = 
He <8 Ht ® Hh, where He = is a finite-dimensional 
Hilbert space spanned by the (orthonormal) control states 
q E Q, while Ht = £ 2 (T) and Hu = £ 2 (1) are separable 
Hilbert spaces describing the contents of the tape and the 
position of the head, where 

T = {(xi) i& G S z | Xl ^ # for finitely many i G Z} (5) 

denotes the set of classical tape configurations with finitely 
many non-blank symbols. 

For our purpose, it is useful to consider a special class of 
QTMs with the property that their tape T consists of two 
different tracks (cf. |3 Def. 3.5.5]), an input track I and an 
output track O. This can be achieved by having an alphabet 
which is a Cartesian product of two alphabets, in our case 
S = {0, 1, #} x {0, 1, #}. Then, the tape Hilbert space Ht 
can be written as Ht =Hi® Ho- 

The transition function 8 generates a linear operator Um 
on Hqtm describing the time evolution of the QTM M. If 
8 is chosen in accordance with certain conditions, then Um 
will be unitary (and thus compatible with quantum theory), 
see Ozawa and Nishimura 0. We identify a G 7i + (W{o.i}*) 
with the initial state of M on input a, which is according to 
the definition in a state on Hqtm where a is written on 
the input track over the cell interval [0,£(<r) — 1], the empty 
state # is written on the remaining cells of the input track and 
on the whole output track, the control is in the initial state qo 
and the head is in position 0. By linearity, this e.g. means that 
the vector |^) = -4= (|0) + |H)) is identified with the vector 
"^2 + 1 1-0) on m P ut track cells number and 1. 

The global state M*(cr) E T+(Hqtm) of M on input a at 
time t G No is given by M*(cr) = (UmY & (Um)* ■ The state 
of the control at time t is thus given by partial trace over all the 
other parts of the machine, that is M c (cr) := Ttt.h (M t (a)) 
(similarly for the other parts of the QTM). In accordance with 
Def. 3.5.1], we say that the QTM M halts at time t EN 
on input a S (W{o,u*)> if and only if 

(q f \M c (a)\q f ) = 1 and (q f \M&(<r)\q f ) =0 Vf < i, (6) 

where g/ e Q is the final state of the control (specified in 
the definition of M) signalling the halting of the computation. 
See Subsection II-AI for a detailed discussion of these halting 
conditions (5). 

In this paper, when we talk about a QTM, we do not mean 
the machine model itself, but rather refer to the corresponding 
partial function on the qubit strings which is computed by the 
QTM. Note that this point of view is different from e.g. that 
of Ozawa [9] who describes a QTM as a map from S* to the 
set of probability distributions on £*. 

We still have to define what is meant by the output of a 
QTM M, once it has halted at some time t on some input qubit 
string a. We could take the state of the output tape M Q (a) 
to be the output, but this is not a qubit string, but instead a 
density operator on the Hilbert space Ho- Hence, we define 
a quantum operation TZ which maps the density operators on 
Ho to density operators on 7i{o,i}*, i.e. to the qubit strings. 
The operation TZ "reads" the output from the tape. 



Definition 2.2 (Reading Operation): 
A quantum operation TZ : T(Ho) — * ^"(^{o,i}*) i s called a 
reading operation, if for every finite set of classical strings 
{ s i\iLi C {0, 1}*, it holds that 



N 



n p 



# # 8i # # 

-2 -1 t(8i) + 1 



f N 



where P(|<^)) := I^K^I denotes the projector onto \ip). 

The condition specified above does not determine 1Z 
uniquely; there are many different reading operations. For the 
remainder of this paper, we fix the reading operation 1Z which 
is specified in the following example. 

Example 2.3: Let T denote the classical output track con- 
figurations as defined in Equation (0, with S = {0,1, 
Then, for every t G T, let R(t) be the classical string that 
consists of the bits of T from cell number zero to the last 
non-blank cell, i.e. 



R 



T — > {0, 1}* 



? ? 



# 



t(sj ((») + 1 



For every s G {0, 1}*, there is a countably-infinite number of 
t E T such that R(t) = s. Thus, to every t E T, we can assign 
a natural number n(t) which is the number of t in some enu- 
meration of the set {t' G T | R(t') = R{t)}; we only demand 
■■##*# # .. 



that n(t) = 1 if t = 
Hence, if (as usual) £ 2 



£ 2 (N) denotes the Hilbert space 



of square-summable sequences, then the map U, defined by 
linear extension of 



U:Ho 
\t) i 

is unitary. Then, the quantum operation 



H {0M *®£ 2 
\R{t))®\n{t)), 



TZ : T{Ho) 



P 



T(H{oa}*) 
Tip (UpU* 



is a reading operation. 

We are now ready to define QTMs as partial maps on the 
qubit strings. 

Definition 2.4 (Quantum Turing Machine (QTM)): 
A partial map M : T+(H {aA y) -> T^(H{ ,i}*) wil1 be 
called a QTM, if there is a Bernstein- Vazirani two-track QTM 
M 1 = (E,Q,8) (see 0, Def. 3.5.5) with the following 
properties: 

. £ = {0,1,#} x {0,1, #}, 

• the corresponding time evolution operator Um 1 is unitary, 

• if M' halts on input a at some time t G N, then M(a) = 
TZ (M' (a )j, w here TZ is the reading operation specified 
in Example 12.31 above. Otherwise, M (a) is undefined. 

A fixed-length QTM is the restriction of a QTM to the domain 
LLeNo ^(K) of length eigenstates. 
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The definition of halting, given by Equation ((6), is very 
important, as explained in Subsection II- Al On the other hand, 
changing certain details in a QTM's definition, like the way 
to read the output or allowing a QTM's head to stay at its 
position instead of turning left or right, should not change the 
results in this paper. 

C. Quantum Kolmogorov Complexity 

Quantum Kolmogorov complexity has first been defined by 
Berthiaume, van Dam, and Laplante fl4l . They define the 
complexity QC(p) of a qubit string p as the length of the 
shortest qubit string that, given as input into a QTM M, 
makes M output p and halt. Since there are uncountably many 
qubit strings, but a QTM can only apply a countable number 
of transformations (analogously to the circuit model), it is 
necessary to introduce a certain error tolerance 8 > 0. 

This can be done in essentially two ways: First, one can 
just fix some tolerance S. Second, one can demand that the 
QTM outputs the qubit string p as accurately as one wants, 
by supplying the machine with a second parameter as input 
that represents the desired accuracy. This is analogous to a 
classical computer program that computes the number n = 
3.14. . .: A second parameter k S N can make the program 
output 7r to k digits of accuracy, for example. We consider both 
approaches and follow the lines of [14] except for two simple 
modifications: we use the trace distance rather than the fidelity, 
and we also allow indeterminate-length and mixed input and 
output qubit strings. 

Definition 2.5 (Quantum Kolmogorov Complexity): Let M 
be a QTM and p G T L + (7i{o,i}*) a qubit string. For every 
5 > 0, we define the finite-error quantum Kolmogorov com- 
plexity QCfj(p) as the minimal length of any qubit string 
a- G T L + (7Y{o,i}*) sucn that the corresponding output M(a) 
has trace distance from p smaller than S, 

QC s M (p) := min{%) | \\p - M(a)\\ Tr < 5}. 

Similarly, we define the approximation-scheme quantum Kol- 
mogorov complexity QCm(p) as the minimal length of any 
qubit string a G / ^i + (W{o,i}*) su ch that when given M as 
input together with any integer k, the output M(a, k) has trace 
distance from p smaller than 1/k: 



QCm(p) :=mm\e(a) 



\\p- M(a,k)\\ Tl < yVkeN 
k 

For the definition of QCm, we have to fix a map to encode 
two inputs (a qubit string and an integer) into one qubit string; 
this is easy, see e.g. [13] for the classical case and |[T9l for the 
quantum case. Also, using f(k) := 1/k as accuracy required 
on input k is not important; any other computable and strictly 
decreasing function / that tends to zero for k — > oo such that 
/ _1 is also computable will give the same result up to an 
additive constant. 

Note that if M is at least able to move input data to the 
output track, then it holds QC M (p) < £(p) + cm with some 
constant cm G N (and similarly for QCm)- In [19], we have 
shown that for ergodic quantum information sources, emitted 
states |V>) G (C 2 )®" have a complexity rate ^QC^(\tp)) that 



is with asymptotic probability 1 arbitrarily close to the von 
Neumann entropy rate s of the source. This demonstrates that 
quantum Kolmogorov complexity is a useful notion, and that 
it is feasible to prove interesting theorems on it. 

While this complexity notion QC(p) counts the length 
of the shortest qubit string that makes a QTM output p 
and halt, there have been different definitions for quantum 
algorithmic complexity by Vitanyi ||20| and Gacs ETI . Their 
approaches are based on classical descriptions and universal 
density matrices respectively and are not considered in this 
paper since they do not have the invariance problem outlined 
in Subsection II-CI 

Note also that Definition 12.51 depends on the definition of 
the length £(er) of a qubit string a G T L + (7i{o.i}* ); there is 
a different approach by Rogers and Vedral ifTTl that uses the 
expected (average) length I instead and results in a different 
notion of quantum Kolmogorov complexity. The results of this 
paper are applicable to that definition, too, as long as the notion 
of halting of the corresponding quantum computer is defined 
in a deterministic way as in Equation (|6]). 

III. Construction of a Strongly Universal QTM 
A. Halting Subspaces and their Orthogonality 

As already explained in Subsection II- Al in the introduction, 
restricting to pure input qubit strings \ip) G H n of some 
fixed length £(\ip)) = n, the vectors with equal halting time t 
form a linear subspace of Ti n . Moreover, inputs with different 
halting times are mutually orthogonal, as depicted in Figure Q] 
We will now use the formalism for QTMs introduced in 
Subsection IH-BI to give a formal proof of these statements. 
We use the subscripts C, I, O and H to indicate to what part 
of the tensor product Hilbert space a vector belongs. 

Definition 3.1 (Halting Qubit Strings): 
Let cr G ^i + (7^{o,i}*) be a qubit string and M a quantum 
Turing machine. Then, a is called t- halting (for M), if M 
halts on input a at time t G N. We define the halting sets and 
halting subspaces 



H M {t) 

n M (t) 

H ( M ] {t) 



= e H {0 ,i}* I \i>){^\ is ^-halting for M}, 

= {a|V> I IV) e%((),a6l}, 



Hm {t) n H T , 



n 



(«), 



H M {t)nH„ 



Note that the only difference between H^' (t) an d (t) is 
that the latter set contains non-normalized vectors. It will be 
shown below that TL ^ (t) is indeed a linear subspace. 

Theorem 3.2 (Halting Subspaces): 
For every QTM M, n G N and t G N, the sets H M {t) and 
Ti^j (t) are linear subspaces of 7Y{o,i}« resp. H n , and 



/(«)/ 



and H M (t) ±H M (t / ) ift^t'. 



Proof. Let \ip), G HM(t). The property that \ip) is ^-halting 
is equivalent to the statement that there are states |$* ) G 
Til ® TLo <8 Tin and coefficients c* G C for every t' < t and 
q G Q such that 

Vft(|¥>)i®|*o» 
^(|v)i®l*o» 



E4W 



(7) 

W < t, (8) 
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where Vm is the unitary time evolution operator for the QTM 
M as a whole, and |\&o) = |<Zo)c ® l#)o <$ |0)h denotes the 
initial state of the control, output track and head. Note that 
l^o) does not depend on the input qubit string (in this case 
!¥>»• 

An analogous equation holds for since it is also t- 
halting by assumption. Consider a normalized superposition 

a\(p) + 0\i/>) G H {0 ,iy: 



VJr ( («lv>i + /?!^>i) ® !*o) 



•0V&Mi®|*o) 



1 <ifi 



•P\lf)c 



9/ ' 



= "k/)c < 

- |«/)c® (a|*' w > 

Thus, the superposition also satisfies condition @, and, by a 
similar calculation, condition (HJ. It follows that + f3\ip) 
must also be i-halting. Hence, H.M{t) is a linear subspace 
of H{o,i}*- As me intersection of linear subspaces is again a 
linear subspace, so must be W$/(f). 

Let now \<p) G H M {t) and |<0) G H M (t') such that £ < *'. 
Again by Equations (|7]i and (JHJ, it holds 



t (^| ® (* |) (y&) 
c(?/k)c 



E 



|$*) = . 



It follows that 7Ym(0 
%(•)■ 



and similarly for H^(-) C 
□ 



The physical interpretation of the preceding theorem is 
straightforward: By linearity of the time evolution, superpo- 
sitions of i-halting strings are again i-halting, and strings 
with different halting times can be perfectly distinguished by 
observing their halting time. 

B. Approximate Halting Spaces 

The aim of this subsection is to show that the halting spaces 
of a QTM can be numerically approximated by a classical 
algorithm. Thus, we give a step by step construction of such 
an algorithm, and show analytically that the approximations it 
computes are good enough for our purpose. The main result 
is given in Theorem 13.41 Before we state that theorem, we fix 
some notation. 

Definition 3.3 (e-t-halting Property): A qubit string a G 
TJ + (7i{ ,i}* ) w iH t> e called e-t-halting for M for some tGN, 
e > and M a QTM, if and only if 



< e for t' < * , 
>l-e for *' = * . 



We denote by S n := {\ip) G H n | |||^)||=1} the 
unit sphere in TL n = (C 2 )®", and by UsQip)) '■= 
{|V>)GH„ I \\\tp) - \(p)\\ < 5} an open ball. The ball 
Us(\(p)) will be called e-t-halting for M if there is some 
\ip) G UsQip)) fl S n which is e-i-halting for M. Moreover, 
we use the following symbols: 

• dist(S', \<p)) :— inf se s II l s )~lv)ll for any subset 5 C TL n 
and \ip) G H n , 



:= {\tp) G H n | (e k \tp) eQ + iQ Vfc}, where 
{l e fe)}fe=i denotes the computational basis vectors ofH n , 
,o\ ._ _V±_ for every vector |^ & -H n \ {0}. 



The set of vectors with rational coordinates, denoted H®, 
will in the following be used frequently as inputs or outputs 
of algorithms. Such vectors can be symbolically added or 
multiplied with rational scalars without any error. Also, given 
\a), \b) G H®, it is an easy task to decide unambiguously 
which vector has larger norm than the other (one can compare 
the rational numbers || |a)|| 2 and || |6)|| 2 , for example). 

Now we are ready to state the main theorem of this 
subsection: 

Theorem 3.4 (Computable Approximate Halting Spaces): 
There is a classical algorithm that, given a classical 
description of a QTM M, integers n G No, t G N, and a 
rational parameter S > 0, computes a description of some 
subspace Tijl (i) C Tt n and a rational number e^™' 5 ' (f ) > 
with the following properties: 

. Almost-Halting: If G H [ ^' S \t), then is (20 5)-*- 
halting for M. 



Approximation: For every G H 
\^ {S) ) G H^' s \t) which satisfies 
Similarity: If 5, A G <™+ 
for every \ijj) G 



(t), there is a vector 



Hf% (t) which satisfies || 



H^' S \t) there is a vector |V> (A) ) G 



such that 5 < g|j e 



L -i"' A) (t),then 



IV> (A) )I 



H\T>{t) and |^ 



HYt'"'(t') for * ^ *', then it holds that \(ip t \^t')\ < 



Almost-Orthogonality: If \ip t 
j(n,5) 

4%/55. 

The description of this algorithm (Definition l3.7b and the proof 
of this theorem (on page [Tot need some lemmas that show how 
certain computational steps can be accomplished. 

Lemma 3.5 (Algorithm for e-t-halting-Property of Balls): 
There exists a (classical) algorithm B which, on input \<p) G 
H% 5, e G Q + , * G N and a classical description s M G {0, 1}* 
of a fixed-length QTM M, always halts and returns either 
or 1 under the following constraints: 

• If Us(\tp)) is not £- i-halting for M, then the output must 
be 0. 

• If U&{\ ip)) is |-*-halting for M, then the output must be 
1. 

Proof. The algorithm B computes a set of vectors 
{\Vk)}k=i C such that for every vector |^) G Ug(\ip)) n 
S n there is a k G {1, . . . , N} such that || \<p k ) ~ \ip)\\ < ^ e, 
and also vice versa (i.e. dist (Us(\<p)) fl S n , \<fik)) < ^ e for 
every k). 

For every k G {1, . . . , N}, the algorithm simulates the QTM 
M on input \cpk) classically for t time steps and computes an 
approximation a(i') of the quantity (qf\MQ(\(pk) (<Pk\)\<lf) f° r 
every *' < *, such that 

3 



o(0-(«/l-Mc(lv*)(v*l)l?/; 



for every *' < t . 



How can this be achieved? Since the number of time steps * 
is finite, time evolution will be restricted to a finite subspace 
Ht C Ht corresponding to a finite number of tape cells, 
which also restricts the state space of the head (that points 
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on tape cells) to a finite subspace Tin. Thus, it is possible 
to give a matrix representation of the time evolution operator 
Vm on Tic ® T^t <£> Tin, and the expression given above can 
be numerically calculated just by matrix multiplication and 
subsequent numerical computation of the partial trace. 

Every \(p k ) that satisfies \a(t') — 8 t 't \ < § £ for every t' < t 
will be marked as "approximately halting". If there is at least 
one \ip k ) that is approximately halting, B shall halt and output 
1, otherwise it shall halt and output 0. 

To see that this algorithm works as claimed, suppose that 
Us(\ip)) is not e-i-halting for M, so for every \tp) S Us(\(f)) 

> 



there is some t' < t such that 
e. Also, for every k G {1, . . . , 

Ug(\ip}) n S n with || \tp k ) - \l) 



8 t , t -{q f \M c (\i>)(i>\)\q f 
N}, there is some vector \ip) g 

< A £, SO 



A, 



> 



> 
> 

> 



St>t-{qf\M&(\tp k )(tpk\)\qf 

(qAM&immqf) - (q f \M c (\v>°kM\)\qf 
{q f \Mi{\ Vk ){^ k \)\q f ) - {q f \M c {Wl){ V l\)\q f 
£ -|||^)(^|-|^)<^|||T r -2.|l-|||^)|| 2 | 

£ -|||V)-|^)H-2|l-|| |^)|||(1 + || Wk)\\) 

3 I,, i iHiii 3 23 

^>-^>H-4-- e>-e t 

Thus, for 



64 



where we have used Lemma IA.3I and Lemma IAT 
every k it holds 



\a(t')-8 tH \ > 



> 



A fc 
23 
32 £ 



[q,\M&{\<Pk)(<Pk\)\qf) -atf) 



-£ 



32 



which makes the algorithm halt and output 0. 

On the other hand, suppose that U$(\ip)) is |-i-halting for 
M, i.e. there is some \tp) E Us(\<p)) n S n which is 
halting for M. By construction, there is some k such that 

ll^-MII ' - 
St>t-(Qf\M&{\<Pk){<Pk\)\qf 



a(t') 
1. 



Set 



< ^ e. A similar calculation as above yields 
< lie for every t' < t, so 

£, and the algorithm outputs 

□ 



32- 



Lemma 3.6 (Algorithm I for Interpolating Subspace): 
There exists a (classical) algorithm / which, on input M, N E 

N, \<pi), . . . , \$ M ),J<pi), . . . , \<pn) e H« d e N, Q+ 9 A > 

5 and Q + 9 A > S, always halts and returns the description 
of a pair (i, U) with i E {0, 1} and U C Tt n a Unear subspace, 
under the following constraints: 

• If the output is (1, U), then U C Ti n must be a subspace 
of dimension dimf/ = d such that dist(C7, l^fe)) < A 
for every k and dist(£/, \ (pi)) > 6 for every I, 

• If there exists a subspace U C TL n of dimension 
dim[/ = d such that dist(J7, l^fc)) < 8 for every fc and 
dist(C/, |<^;)) > A for every I, then the output must be of 
the@ form (1,U). 

The description of the subspace U is a list of linearly inde- 
pendent vectors {\ui)}f =1 E\H^C\U. 



Proof. Proving this lemma is a routine (but lengthy) exercise. 
The idea is to construct an algorithm that looks for such a 
subspace by brute force, that is, by discretizing the set of all 
subspaces within some (good enough) accuracy. We omit the 
details. □ 

We proceed by defining approximate halting spaces as 
the output of a certain algorithm. It will turn out that these 
spaces satisfy all the properties stated in Theorem 13.41 Note 
that the definition depends on the details of the previously 
defined algorithms in Lemma 13.51 and 13.61 (for example, there 
are always different possibilities to compute the necessary 
discretizations). Thus, we fix a concrete instance of all those 
algorithms for the rest of the paper. 

Definition 3.7 (Approximate Halting Spaces): 
We defin^l the (5-approximate halting space 7i^}' S (t) C TL n 
and the (5-approximate halting accuracy £^ ,Sl \t) E Q as the 
outputs of the following classical algorithm on input n, t E 
N, < 8 E Q and s M £ {0,1}*, where s M is a classical 
description of a fixed-length QTM M: 

(1) Let e := 18 8. 

(2) Compute a covering of S n of open balls of radius 8, that 
is, a set of vectors {|^i), . . . , \iPl)} CH® (L E N) with 
|| |Vfc)|| e (l - f ,1 + I) for every k E {1, . . . ,L} such 
that Sn C Uti U 6 {\ik)). 

(3) For every k E {1, ...,£}, compute B(\ij) k ), 8, s, t, sm) 
and B(\tl>f.)) <^ 18 8, t, sm), where B is the algorithm for 
testing the e-t-halting property of balls of Lemma 13.51 If 
the output is for every k, then output ({0}, e) and halt. 
Otherwise set for N 3 N < L and N 3 K < L 

{\Vi)}*Li ■= I B(\i> k ),8,e,t,s M ) = l}, 

:= Wfc> I B(\il> k ),8,185,t,8 M ) =0}. 

If N = 0, i.e. if the set {\ifii)}fLi i s empty, output 
({0},e) and halt. 

(4) Set d := 2™. 

(5) Let A := 28, A := |<5 and 8 := §5. 
Use the algorithm / of Lemma 13.61 to search 
for an interpolating subspace, i.e., compute 
I(K,N,\<pi),...,\^K)j\vi),--',\fNhd,A,S,A,S). If 
the output of / is (1, U), output ((/, ej and halt. 

(6) Set d := d— 1. If d > 1, then go back to step (5). 

(7) Set e :— | and go back to step (3). 

Moreover, let H { ^ 5 \t) := H^' 5) (t) n S„. 

The following theorem proves that this definition makes 



Theorem 3.8: The algorithm in Definition 13.71 always ter- 
minates on any input; thus, the approximate halting spaces 
TL^ S \t) are well-defined. 

Proof. Define the function e m in ■ S n — *■ Rq by e m i n {\^)) := 
inf{e > | is e-^-halting for M}. Lemma IA.3I and IA.5I 
yield 



< 



IV'l) - |-02> 



(9) 



U will then be an approximation of U. 



3 From a formal point of view, the notation should rather read H^ 1 ^ (t) 



instead of H^' S) '(*), since this space depends also on the choice of the 
classical description sjj of M . 
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so £ m in is continuous. For the special case H^y{t) = 0, it 
must thus hold that e min (S n ) := mm WeSn e min (\tp)) > 0. 
If the algorithm has run long enough such that e < e m i n (S n ), 
it must then be true that B(\ipk), 6, e, t, Sm) — for every 
k G {1, . . . , L}, since all the balls Ug(\ipk)) are not e-i-halting. 
This makes the algorithm halt in step (3). 

Now consider the case H^\t) ^ 0. The continuous 
function £ m in attains a minimum on every compact set 
U s (\ipk}) H S n , so let E fe := min w& u s ^ k})nSn e min (|^)) 
(1 < k < N). If £fc = for every k, then for every k and 
e > 0, there is some vector \if>) £ Us(\^k)) H S„ which 
is e-i-halting for M, so <5, e, t, Sj^f) = 1 for every 

£ > 0, and so = in step (3). Thus, the algorithm / will 
by construction find the interpolating subspace U = Ti n and 
cause halting in step (5). 

Otherwise, let e := minjefc | k £ {1, . . . , N}, £f- > 0}. 
Suppose that the algorithm has run long enough such 
that e < £ . By construction of the algorithm B, if 
B(\i/jk),S,£,t,SM) = 1, it follows that Us(\ipk)) is £ -t- 
halting for M, but then, £fc < e < £q, so = 0, so 

there is some lip) £ ^(IV 1 *)) ^ ^« which is 0-i-halting 

An) 



Jn,5) 



By definition of /, 



: holds 

dist(TC^' 0> (t), < A, and by elementary estimations it 



follows that dist(W^ ,5) (t) n S n , |^)) < | + 2A, so there is 
some |</> (5) > G such that || |V W > - 1^)11 < f + 2A. 

Since || — < 8 and A = 28, the approximation 

property follows. 

Notice that under the assumptions given in the statement 
of the similarity property, it follows from the almost-halting 
property that if \tjj} G H^ S) (t), then \tp) must be |e^ A) (t)- 



4 C M 



^-halting for M. Consider the computation of TCj}'"' if) by 
the algorithm in Definition 13.71 By construction, it always 
holds that the parameter e during the computation satisfies 



£ > £ 



M (*)> so W ^ S always f-i-halting for M, and if 



\tp) G UsQipj)), it follows that BQipj), 8, e, i, s M ) = 1. 
The rest follows in complete analogy to the proof of the 
approximation property. 

For the almost-orthogonality property, suppose 



\v) G H { ^' S \t') and \w) £ #M° ; (i) are two arbitrary 
qubit strings of length n with different approximate halting 
times t < t' £ N. There is some I £ {1, . . . , L} such that 

\w) £ UeQin)). so dist(W^' 5) (t),|^)) < « < I Since 



(n,<S), 



for M, so dist(WS ; (t), |-0fe» < 6. On the other hand, if 

B(\i; k ),S, 18 5, t, s M ) = 0, it follows that U 5 (\^ k )) is not ^'^'J,?^' ' '; ' I^Jf !>> \\ ' ' d ' ' 5 ' ' J 
(§£)-Mialting for M. Thus, dist ^^(t), 



according to (O, so dist(«V7(t) n 5 n , |^fe)) > 4<5, and 



by elementary estimations dist(7i]^ (t), > |<5. 

By definition of the algorithm /, it follows that 
I(K, N, . . . , \<p K ), . . • , |wv>, d, A, 5, A, 5) 
(1,17) for d := dimTC^ (t) > 1 and some subspace 
U C Tin, which makes the algorithm halt in step (5). □ 

We are now ready to prove Theorem 13.41 by showing that 
the approximate halting spaces defined above indeed satisfy 
the properties stated in that theorem. 



Proof of Theorem 



Assume that H 



(n,5) 
M 



(t) + 



£ H$'°>(t) c S n , and let . . . , \tp L }} C 



Let | 

Ti n be the covering of S n from the algorithm in Defini- 
tion [33] By construction, there is some k £ {1, ...,L} 
such that £ Us{\4>k})- The subspace Ti^/ S \t) is 



(l,H ( M' S) (t)) at step (5) of the computation of TL^' 5) {t), 
it follows from the definition of I that there is no m £ N 
such that | ipi) = \<p m ) for the sets defined in step (3) 
of the algorithm above. Thus, B(\ipi), 6, 18 5, t, sm) — 1> 
and by definition of B it follows that Us(\ipi)) must 
be (18 (5)-£-halting for M, so there is some vector 
\w) £ Us(ipi)) n S n which is (18 <5)-i-halting for M and 
satisfies || \w) - \w)\\ < \\ \w) - \ipi)\\ + \\ ~ \w)\\ < 26. 
Analogously, there is some vector \v) £ S n which is 
(18 5)-f -halting for M and satisfies || \v) - \v)\\ < 26. 

From the definition of the trace distance for pure states (see 
lfT8l (9.99)] and of the e-i-halting property in Definition [ 
together with Lemma IA.3I and Lemma IA.5I it follows that 



computed in step (5) of the algorithm in Definition 13.71 

via I(K, N,\f>i), . . . , \ <Pk)i • • • j Wn), d, A, <5, A, 5) = 
(1,H { m S) (t)), and since dist(?i^' 5) (t), |^fc» < 6, it follows 
from the properties of the algorithm / in Lemma 13.61 that 
l^fc) ^ \(pi) for every I £ {1, . . . , K} in step (3) of the algo- 
rithm. Thus, B(\ipk) ,6,18 5,t, sm) = 1, and it follows from 
the properties of the algorithm B in Lemma [331 that Ug(\ipk)) 
is (18 <5)-i-halting for M, so there is some £ Us(\ipk))rtS n 
which is (18 <5)-t-halting for M. Since || |<i/>) - < 26, the 
almost-halting property follows from Equation ©■ 

To prove the approximation property, assume that 
H M*{t) ^ 0. Let |^) £ H${t) C S n \ again, there is some 
j G {1, . . . , L} such that \ip) £ U s (\i/)j)), so U s (\ipj)) is 0-t- 
halting for M, and B(\ipj) ,8,£,t, sm) = 1 for every e > 
by definition of the algorithm B. For step (3) of the algorithm 
in Definition 13.71 it thus always holds that \ipj) £ {\fi)}f = i- 
The output of the algorithm is computed in step (5) via 
I(K, N, \(p x ), \<p K ), |vi>, - ■ ■ , \<Pif),d, A, 5, A, 6) 



1-\(W\V)\ 2 = || \W)(W\ - \V)(V\ Hxr 

II \&){w\ - \v)(v\ \\ T v 

-II \w)(w\ - \w)(w\\ || Tr 
-|||«)(«|-|fi>(«||k 

\(q/\Mhm(^\)\Q/) 

-(«/|Jl4(l«X«l)k/>| 
-|| \w)- \w)\\ -|| 
1 - 36 6 - 26 - 26 = 1 - 40 6. (10) 



> 



> 



> 



This proves the almost-orthogonality property. 



□ 



The following corollary proves that the approximate halting 
spaces TC^' S \t) are "not too large" if 8 is small enough. 

Corollary 3.9 (Dimension Bound for Halting Spaces): 
If 8 < ± 2~ 2n , then ^ dim (t) < 2". 

ten 



Proof. Suppose that YlteN ( ^ m 'H-M 



(n,6) 



(t) > 2". Then, choose 



orthonormal bases in each of the spaces Ti { M' 5 \t), and let 

"C 1 V») }-* =i~ ^ e ^ e un io n of the first 2" + 1 of these basis vec- 
tors. By construction and by the almost-orthogonality property 
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of Theorem [3741 it follows that \{<Pi\ipj}\ < 4^56 < 2~ n = 
(2"+i)-i f° r ever y * 7^ j- Lemma |A~T1 yields dim U > 2™ + 1 
for U := span{|</Jj)} i= ^ C H n , but dimH n = 2™, which is 
a contradiction. □ 

C. Compression, Decompression, and Coding 

In this subsection, we define some compression and coding 
algorithms that will be used in the construction of the strongly 
universal QTM. 

Definition 3.10 (Standard (De-)Compression): 
Let U C H n be a linear subspace with dim U = N. Let 
Pu G B(H n ) be the orthogonal projector onto U, and let 
{l e i)}?=i be the computational basis of H n - The result of 
applying the Gram-Schmidt orthonormalization procedure to 
the vectors {|'Ui)L_ 1 = {Pu\&i)}i—i (dropping every null 
vector) is called the standard basis {\u\), . . . , \un)} of U. Let 
\fi) be the i-th computational basis vector of Wn g/y-i. The 
standard compression Cjj : U —* "Hn gN] is then defined by 
linear extension of Cjj{\ui)) := \ fi) for 1 < i < N, that is, Cjj 
isometrically embeds U into Hn og m . A linear isometric map 
T^U '■ Hn 0& m — > will be called a standard decompression 
if it holds that 

T>u o Cu = lu ■ 

It is clear that there exists a classical algorithm that, given a de- 
scription of U (e.g. a list of basis vectors {K)}fi m!7 C H®), 
can effectively compute (classically) an approximate descrip- 
tion of the standard basis of U. Moreover, a quantum Turing 
machine can effectively apply a standard decompression map 
to its input: 

Lemma 3.11 (Q-Standard Decompression Algorithm): 
There is a QTM D which, given a descriptioryof a subspace 
U C H n , the integer n G N, some 5 G Q + , and a quantum 
state G H [log dim [71 1 outputs some state \ip) G 7i n with the 
property that || \<p) — T>xj\ip)\\ < S, where T>u is some standard 
decompression map. 

Proof. Consider the map A : W[i og dim[7l — * H n , given 
by A\v) := |o)«>("- n°s dim [/l ) ^ The map A prepends 

zeroes to a vector; it maps the computational basis vectors of 
H n g dim [71 to the lexicographically first computational basis 
vectors of H n - The QTM D starts by applying this map A to 
the input state \tp) by prepending zeroes on its tape, creating 
a state |V>) := |o)®("-P°sdim[T|) ® |^) g ft n . 

Afterwards, it applies (classically) the Gram-Schmidt 
orthonormalization procedure to the list of vectors 
. . . , \udimu), |ei), ■ ■ ■ , |e 2 »)} C 7Y^, where the 
vectors {\ui)}fl^ u are the basis vectors of U given in the 
input, and the vectors {|ei)}| =1 are the computational basis 
vectors of H n - Since every vector has rational entries (i.e. 
is an element of H®), the Gram-Schmidt procedure can be 
applied exactly, resulting in a list {|wj)}| =1 of basis vectors 
of H n which have entries that are square roots of rational 
numbers. Note that by construction, the vectors 
are the standard basis vectors of U that have been defined in 
Definition IXTOl 

4 (a list of linearly independent vectors { ) , . . . , |«dim(y)} C U PI H?) 



Let V be the unitary 2™ x 2™-matrix that has the vectors 
{|tii)}f = i as its column vectors. The algorithm continues by 
computing a rational approximation V of V such that the 
entries satisfy \V tj - V tj \ < 2n+ i {1 Q^ )2 ™ . and thus, in 
operator norm, it holds II y — VII < -; — ? — r^r- Bernstein 

'MM 2(10V2") 2 

and Vazirani J5] Sec. 6] have shown that there are QTMs 
that can carry out an e-approximation of a desired unitary 
transformation V on their tapes if given a matrix V as input 
that is within distance — — . of the d x d-matrix V . This 

2110Vd) d 

is exactly the case herqj, with d = 2™ and e = S, so let the 
V apply V within 5 on its tape to create the state \<p) G H n 
with || l^) - V\i>}\\ = || \ip) - V o A\ip)\\ < 5. Note that the 
map V o A is a standard decompression map (as defined in 
Definition I3.101 l. since for every i E {1, . . . , dim U} it holds 
that 

VoAoCu\ Ui ) = VoA\fi) = V|e;) = \ Ui ) , 
where the vectors |/j) are the computational basis vectors of 

H [log dim [71 ■ D 

The next lemma will be useful for coding the "classical part" 
of a halting qubit string. The "which subspace" information 
will be coded into a classical string q G {0, 1}* whose length 
ii G No depends on the dimension of the corresponding 
halting space H^j S \ti). The dimensions of the halting spaces 
(dim H ( Z' 5) (ti ), dim H^ 5) (t 2 ),...) can be computed one 
after the other, but the complete list of the code word lengths 
ti is not computable due to the undecidability of the halting 
problem. Since most well-known prefix codes (like Huffman 
code, see |22|) start by initially sorting the code word lengths 
in decreasing order, and thus require complete knowledge of 
the whole list of code word lengths in advance, they are 
not suitable for our purpose. We thus give an easy algorithm 
that constructs the code words one after the other, such that 
code word a depends only on the previously given lengths 
£i, £2, ■ ■ ■ , £%■ We call this "blind prefix coding", because code 
words are assigned sequentially without looking at what is 
coming next. 

Lemma 3.12 (Blind Prefix Coding): 
Let {£i}fL 1 C No be a sequence of natural numbers (code 

N 

word lengths) that satisfies the Kraft inequality 2~ li < 1. 

i=i 

Then the following ("blind prefix coding") algorithm produces 
a list of code words {cj}f =1 C {0, 1}* with £(a) = ti, such 
that the i-th code word only depends on £j and the previously 
chosen codewords c%, . . . , Ci-\. 

• Start with c\ :— O^ 1 , i.e. c\ is the string consisting of £\ 
zeroes; 

• for i — 2, . . . , N recursively, let Cj be the first string in 
lexicographical order of length £(cj) = £\ that is no prefix 
or extension of any of the previously assigned code words 

Cl, .. . , Cj_l. 

Proof. We omit the lengthy, but simple proof; it is based 
on identifying the binary code words with subintervals of 

5 Note that we consider H„ as a subspace of an ra-cell tape segment Hilbert 
space (C^ ' 1 '*})®", and we demand V to leave blanks |#) invariant. 
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[0, 1) as explained in |[T3l . We also remark that the content 
of this lemma is given in l22l Thm. 5.2.1] without proof as 
an example for a prefix code. □ 

D. Proof of the Strong Universality Property 

To simplify the proof of Main Theorem ll.il we show now 
that it is sufficient to consider fixed-length QTMs only: 

Lemma 3.13 (Fixed-Length QTMs are Sufficient): 
For every QTM M, there is a fixed-length QTM M such that 
for every p £ (W{q,i}« ) there is a fixed-length qubit string 

P e U„ eN o r i + (^™) such that M (<°) = M(p) and £{p) < 
£(p) + 1. 

Proof. Since dim7Y<„ = 2 n+1 — 1, there is an isometric 
embedding of H< n into Ti n +i- One example is the map V n , 
which is defined ~as K|e 4 ) := \fi) for i £ {1, . . . , 2 n+1 - 1}, 
where | e^) and denote the computational basis vectors 
(in lexicographical order) of Tt< n and H n +i respectively. As 
H n+ i C W<(„ + i) and U< n C W<( n+1 ). we can extend 
V n to a unitary transformation U n on H<( n+1 ), mapping 
computational basis vectors to computational basis vectors. 

The fixed-length QTM M works as follows, given some 
fixed-length qubit string p £ (fi„+i) on its input tape: 
first, it determines n + 1 = £(p) by detecting the first blank 
symbol Afterwards, it computes a description of the unitary 
transformation U* and applies it to the qubit string p by 
permuting the computational basis vectors in the (n+ 1) -block 
of cells corresponding to the Hilbert space Act ' 1 '*!) 8 " +1 . 
Finally, it calls the QTM M to continue the computation on 
input p := U* p U n . If M halts, then the output will be M{p). 

□ 

Proof of Theorem 11.11 First, we show how the input cm 
for the strongly universal QTM il is constructed from the input 
a for M. Fix some QTM M and input length n £ No, and let 



£0 



81 Z 



2d 



Define the halting time sequence {t { M 



as the set of all integers t £ N such that dimH^' e °\t) > 1, 
ordered such that t^' (i) < vJr (i+ 1) for every i. The number 
TV is in general not computable, but must be somewhere 
between and 2™ due to Corollary 13.91 

For every i £ {1, . . . ,N}, define the code word length 
£) as 



JM,n) 



1 - 



logdimWi™'^ 



(*$?(<) 



This sequence of code word lengths satisfies the Kraft inequal- 
ity: 



N 
i=l 



JV 



= 2 



< 2" 



= 2~ 



JV 



5 dim 7-{ 



(n,e ) 
M 



^dimW 

^dimH ( "' £o) 



A/ 



(*) < 1 » 



where in the last inequality, Corollary 13.91 has been used. Let 
j c ( M ,™) | ^ e ^ blind prefix code corresponding 



to the sequence |^ M ' n ) j which has been constructed in 
Lemma 13.121 

In the following, we use the space as some kind 

of "reference space" i.e. we construct our QTM il such that 
it expects the standard compression of states \ijj} £ Ti!^f eo \t) 
as part of the input. If the desired accuracy parameter 6 is 
smaller than eq, then some "fine-tuning" must take place, 
unitarily mapping the state £ (t) into halting 

spaces of smaller accuracy parameter. In the next paragraph, 
these unitary transformations are constructed. 

Recursively, for k £ N, define e k := ^m^'^ '(*)■ 



Since e 



(n,S) 



M 



(*) 



< 18(5 by construction of the algorithm in 

Definition 13.71 we have e k < (|§) fc ■ £o *— 0. It follows 
from the approximation property of Theorem 13.41 together with 
Lemma[Qlthat dim H^f (t) > dimW^ 3 (t). The similarity 
property and Lemma [A4l tell us that A\m.W^ ek ^ 1 \t) > 
dimTi. jl (t) for every k £ N, and there exist isometries 



satisfy 

\\u k - 



(*) 



(t) that, for k large enough, 



<3 



8 /ll 



1 o 



Let now d 

min j k £ N | 



lim fe _ 



2 " /18 
< const„ • I — 

&miH { M' ek) {t) and c 



(ID 



dim 



transformations U k (they are not unique), let 



i|. For any choice of the 



(*) 



U k +\U k+ 2 ■ ■ ■ U C H 
(*) 



H 



M 



(*) 



if k < c , 
if k > c . 



It follows that the spaces Hi^ k) {t) all have the same dimen- 



sion for every k £ N , and that H { ^ £k) (t) C H 



,(n,e k ) 



Define the unitary operators U k := U k \ H^l' Sk \t), then 

1^2 



1|| < \\U k - 1||, and so the sum J2T=i \\ U k 
converges. Due to Lemma IA.2I the product U := Ylt 



converges to an isometry U : 7i^' eo \t) — > TL n - It fol- 



lows from the approximation property in Theorem 13.41 that 
Tijl (t) C ran(f/), so we can define a unitary map U^ 1 : 
ran(£/) n { ^' Eo) (t) by U'^Ux) := x, and C 

dom(C/- 1 ). 

Due to Lemma 13.131 it is sufficient to consider fixed-length 
QTMs M only, so we can assume that our input a is a fixed- 
length qubit string. Suppose M(a) is defined, and let r £ N be 
the corresponding halting time for M. Assume for the moment 
that a = \ip)(ip\ is a pure state, so \ip) £ H^\t). Recall the 
definition of the halting time sequence; it follows that there is 
some i £ N such that r 



*&>(*). Let 



\^ M ' n) ) ■= I 



(T) 



that is, the blind prefix code of the halting number i, followed 
by the standard compression (as constructed in Definition l3.10l l 
of some approximation U^ 1 ^) of \if>) that is in the subspace 
H { m' £o) (t). Note that 

/(|^.«)>) = ^(c^)+^(C w ^ 0)(r) f/-V)) 



= £ 



(M,n) 



logdimH^ £o) (T) 
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If <7 = Afe|i/ , fe)('0fe| is a mixed fixed-length qubit string 
which is r-halting for M, every convex component \i[>k) 
must also be r-halting for M, and it makes sense to define 
a (M,n) . = J2k^\4 M - n) )(4 M,n) \, where every \^[ M ' n) ) 
(and thus a^ M ' n ^) starts with the same classical code word 



S M > n ) 



and still cr( M >") £ T+{H n+ i) 



The strongly universal QTM il expects input of the form 



, Sm 



JM,n) 



(12) 



where sm £ {0, 1}* is a self-delimiting description of the 
QTM M, We will now give a description of how il works; 
meanwhile, we will always assume that the input is of the 
expected form ( fl2b and also that the input a is a pure qubit 
string (we discuss the case of mixed input qubit strings 

a afterwards): 

• Read the parameter S and the description sm- 

• Look for the first blank symbol # on the tape to deter- 
mine the length ^(cj( M ' n )) = n + 1. 

• Compute the halting time r. This is achieved as follows: 

(1) Set t := 1 and i := 0. 



(n,e ) 



(2) Compute a description of Tlm" 
dim r4"' £o) (t) = 0, then go to step (5). 



(t). 



If 



(3) Set 



I and set lf' Un) := n + 1 

From the previously com- 



logdimr^ £o) (t) 



puted code word lengths £ 



(M,n) 



(1 < j < i), 



compute the corresponding blind prefix code word 
c\ M,n \ Bit by bit, compare the code word c| M '™' ) 
with the prefix of a^ M ' n \ As soon as any difference 
is detected, go to step (5). 

(4) The halting time is r := t. Exit. 

(5) Set t := t + 1 and go back to step (2). 

Let \ip) be the rest of the input, i.e. 

a[ M ^ =: \4 M ' n) )(4 M ' n) \ ® |$$| (thus 
e C„,(™,en), with some irrelevant phase 



M 



£ M). Apply the quantum standard decompression 
algorithm D given in Lemma 13.1 II i.e. compute 



\<p) :=S)(w^ 0) (r),n,|, 



Then. 



'Mi 



= |||^)-C7- 1 |^)||< I 



Compute an approximation V : 7i n 
unitary extension of U with 



8/3 



u-v 



H n of a 

!y(n,eo) 
M 



< 



h — - ,„ =: e, where {/ is some "fine-tuning map" 
as constructed above. This can be achieved as follows: 



- Choose N 



N 

'1S\ 



large enough such that 

J2T=n+i const„ • (||) 2 < §, where const„ £ K 
is the constant defined in Equation dTTb . 
- For every k £ {1, . . . , A^}, find matrices Vk '■ H n — > 
7i„ that approximate the forementionec^ isometries 

6 The isometi'ies (7 ^ are not unique, so they can be chosen arbitrarily, except 
for the requirement that Equation (TTJ is satisfied, and that every depends 
only on fi.^' ek ^ (t) and H^ I ' £k ~ 1 \t) and not on other parameters. 



N 



(t) -> n { M £k - l] (t) such that 



iV 



fc=i 



k=l 



< 



Setting V ■= Yik=i ^fe* w ^ work as desired, since 



il'V ' 



fc=l 



< 



fe=JV+l 



< 



< > const,, 

fc=W+l v 7 

due to Equation ( fTTT i and the proof of Lemma IA.2I 
Use V to carry out a | -approximation of a unitary 
extension U of J7 on the state |<^) on the tape (the 
reason why this is possible is explained in the proof 
of Lemma [3 . 1 1 b - This results in a vector \<p) with the 
property that || \tp) - U\0)\\ < |. 

Simulate M on input \(fi)(ip\ for r time steps within an 
accuracy of |, that is, compute an output track state po £ 

T+(fto) with ||po - ^"odvXvDIlTr < f. move this 
state to the own output track and halt. (It has been shown 
by Bernstein and Vazirani in [ 3 1 that there are QTMs that 
can do a simulation in this way.) 



Let (7m 



sm 



JM,n) 



Using the contractivity of the trace 



distance with respect to quantum operations and Lemma IA. 31 
we get 

\\il(a M> 6) - M(^)(^|)|| Tr = 

= \\K{po)-n{M T {\^)m)\\Tr 

< llpo-Afgfl^Dlk 

+ ||M5(| ¥3 )V|)-M5(|^(V|)lk 



< - 



< - 



< 



5 

3 
6 

3 
6 

3 

Is- 

3 



\<p)-U\<p)\\- 

\<p)-u*W) 



\um-m\ 



< 6 



This proves the claim for pure inputs a = If 
a = X)fc / ^fclV , fe)(' ( /'fe| is a mixed qubit string as explained 
right before Equation ( fT2l . the result just proved holds for 
every convex component of a by the linearity of M, i.e. 
\\pk — M(\tpk) (^feDllxr < an< ^ me assertion of the theorem 
follows from the joint convexity of the trace distance and the 
observation that il takes the same number of time steps for 
every convex component □ 

This proof relies on the existence of a universal QTM U in 
the sense of Bernstein and Vazirani as given in Equation (fl}. 
Nevertheless, the proof does not imply that every QTM that 
satisfies (Q~|) is automatically strongly universal in the sense of 
Theorem II. U for example, we can construct a QTM U that 
always halts after T simulated steps of computation on input 
(sm, T, 6, and that does not halt at all if the input is not 
of this form. So formally, 

{U QTM universal by dTJ} 2 {il QTM strongly universal}. 
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Proposition 3.14 (Parameter Strongly Universal QTM): 
There is a fixed-length quantum Turing machine il with 
the property of Theorem 11.11 that additionally satisfies the 
following: For every QTM M and every qubit string a £ 

1\ (^{o,i}*)' there is a q ubit strin 8 °M e T\ (^{0,1}*) 
such that 

||H (cr M , k) - M (a, 2k)\\ Ti < -!- for every k £ N 

if M(a, 2k) is defined for every k £ N, where the length of 
(7m is bounded by £{<tm) < ^(c) + cm, and cm G N is a 
constant depending only on M. 

One might first suspect that this proposition is an easy 
corollary of Theorem 11.11 but this is not true. The problem 
is that the computation of M(a, k) may take a different 
number of time steps r for different k (typically, r — > oo 
for k — > oo). Just using the result of Theorem 11.11 would give 
a corresponding qubit string <jm that depends on k, but here 
we demand that the qubit string cfm is the same for every 
k, which is important for the proof of Theorem 11.21 to fit the 
definition of QC. 

Thus, we have to give a new proof that is different from 
the proof of Theorem 11.11 Nevertheless, the new proof relies 
essentially on the same ideas and techniques; for this reason, 
we will only sketch the proof and omit most of the details. 

The proof sketch is based on the idea that a QTM which 
is universal in the sense of Bernstein and Vazirani (i.e. as in 
Equation fl}) has a dense set of unitaries that it can apply 
exactly. We can call such unitaries on ri n for n £ N il-exact 
unitaries. 

This follows from the result by Bernstein and Vazirani that 
the corresponding UQTM U can apply a unitary map U on its 
tapes within any desired accuracy, if it is given a description of 
U as input. It does so by decomposing U into simple ("near- 
trivial") unitaries that it can apply directly (and thus exactly). 

We can also call an n-block projector P £ B(TL n ) il-exact 
if it has some spectral decomposition P = Yli\' t Pi)(' l Pi\ 
such that there is a il-exact unitary that maps each \ipi) to 
some computational basis vector of Ti n . If P and 1 — P 
are il-exact projectors on Ti n , then il can do something like 
a "yes-no-measurement" according to P and 1 — P: it can 
decide whether some vector \ip) £ fi n on its tape is an 
element of ranP or of (ranP)- 1 with certainty (if either one 
of the two cases is true), just by applying the corresponding 
il-exact unitary, and then by deciding whether the result is 
some computational basis vector or another. 

Proof Sketch of Proposition 13.141 In analogy to Defini- 
tion [XT] we can define halting spaces 7ij^(ti,ta, . . . ,tj) as 
the linear span of 

H^\tx,t 2 ,...,tj):={\i>) £ri n | is ^-halting 

for M (1 < % < j)}. 

Again, we have < (feJLi) J_ H# ($)J = i) if t ? 
t', and now it also holds that (tx, ■ ■ . ,tj,t j+i) C 
Hl9(ti, ...,tj) for every j £ N. Moreover, we can define 
certain (^-approximations Ti^^iti, ■■■ ,tj). We will not get 



into detail; we will just claim that such a definition can be 
found in a way such that these (^-approximations share enough 
properties with their counterparts from Definition |3.7| to make 
the algorithm given below work. 

We are now going to describe how a machine il with the 
properties given in the assertion of the proposition works. 
It expects input of the form (fc, / ® sm <8 a^ M,n '), where 
/ £ {0,1} is a single bit, sm £ {0,1}* is a self-delimiting 
description of the QTM M, cr (M ' n) £ T+(H {0il} ,) is a 
qubit string, and k £ N an arbitrary integer. For the same 
reasons as in the proof of Theorem ll.il we may without loss 
of generality assume that the input is a pure qubit string, so 
CT (M,n) = ^(M,n)^^(M,n)y Moreover, due to Lemma EQ] 
we may also assume that M is a fixed-length QTM, and so 
a (M,n) e T^(H n ) is a fixed-length qubit string. 

These are the steps that il performs: 

(1) Read the first bit / of the input. If it is a 0, then proceed 
with the rest of the input the same way as the QTM that 
is given in Theorem 11.11 If it is a 1, then proceed with 
the next step. This ensures that the resulting QTM il will 
still satisfy the statement of Theorem 11.11 

(2) Read sm, read k, and look for the first blank symbol # 
to determine the length n := £(er( M '™)). 

(3) Set j := 1 and 6q £ <Q> + (depending on n) small enough. 

(4) Set t := 1. 

(5) Compute (tj., . . . , Tj-\, t). Find a il-exact projec- 
tor P^\t\, . . . , Tj-i, t) with the following properties: 

. P^ ] (n , . . . , Tj -x , t>) ■ P$ ( n , . . . , 73-1, t) = for 
every 1 < t' < t, 

. P$\T 1 ,...,T j -ut)<P$\n,...,T j - l ), 

• the support of Pj£ (ti, ■ ■ ■ , T j-i,t) is close enough 

(6) Make a measurement described by 
P^ ) (T 1 ,...,T j - 1 ,t). If |V> (M ' n) ) is an element of 
the support of P^ (ri, . . . ,Tj_i,f), then set Tj := t and 
go to step (7). Otherwise, if \ip( M ' n )) is an element of 
the orthogonal complement of the support, set t := t + 1 
and go back to step (5). 

(7) If j < 2k, then set j := j + 1 and go back to step (4). 

(8) Use a unitary transformation V (similar to the transforma- 
tion V from the proof of Theorem I l.U to do some "fine- 
tuning" on r,n )) , i.e. to transform it closer (depending 
on the parameter k) to some space Hy (ti, ... ,ta) D 
Ttj^ (ti, ...,Tj) containing the exactly halting vectors. 
Call the resulting vector \^ M ^) := V\^ {M ^). 

(9) Simulate M on input (\^ M ^){^ M > n \2k\ for r 2fc 
time steps within some accuracy that is good enough, 
depending on k. 

Let 7ij" A) (tx,...,tj) be the support of P { A f (tx,..., tj). 
These spaces (which are computed by the algorithm) have the 

7 It is not really a measurement, but rather some unitary branching: 
if ip( M ' n '} is some superposition in between both subspaces W := 
supp (pffi (ti,..., Tj-x, tj\ and VK X , then the QTM will do both possible 
steps in superposition. 
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properties 

nt M ({u)U) J- h^ o) («-)?=i) ift^f, 

H§ 5o \tu tj,t j+1 ) c Ht M \tu tj) Vj e N, 

which are the same as those of the exact halting spaces 
TL M \t\, ...,tj). If all the approximations are good enough, 
then for every £ H^\ti, . . . ,tj) there will be a vector 

|V> (M ' n) > € wj£ ,4o) (ii,...,ij) such that || |V>)-V|^ M '"))|| is 
small. If this |?/>( M '™)) is given to il as input together with all 
the additional information explained above, then this algorithm 
will unambiguously find out by measurement with respect to 
the il-exact projectors that it computes in step (5) what the 
halting time of is, and the simulation of M will halt after 
the correct number of time steps with probability one and an 
output which is close to the true output M(a, 2k). □ 



Proof of Theorem 11.21 First, we use Theorem 11.11 to 
prove the second part of Theorem 11.21 Let M be an arbitrary 
QTM, let It be the ("strongly universal") QTM and c M the 
corresponding constant from Theorem ll.il Let £ := QC M (p), 
i.e. there exists a qubit string a E T L + (7Y{ .i}*) with £( (J ) = £ 
such that 

||M(<t) - pHtt < S . 

According to Theorem 11.11 there exists a qubit string <tm £ 

7"i + (W{ ,i}») with £(<tm) < £{&) + c M = £ + c M such that 

||il(cr M , A - 8) - M(a)\\ Tl <A-S. 

Thus, \\ii(a M ,A - S) - p\\ T r < A, and £(a M ,A - S) = 
£(a M ) + ^(A - 5) < £ + c M + cs.a, where c s ^a € N is 
some constant that only depends on S and A. So QC^(p) < 

£ + C M ,S,A- 

The first part of Theorem [L2] uses Proposition [3T4] Again, 
let M be an arbitrary QTM, let il be the strongly univer- 
sal QTM and cm the corresponding constant from Proposi- 
tion [3T4] Let £ :— QCm(p), i-e- there exists a qubit string 
cr e T^(H{ with £(a) = £ such that 

II M (a, k) - p||tt < r for evel T k eN . 
k 

According to Proposition l3.14l there exists a qubit string <tm € 

7i + (W{ ,i}*) with £(a M ) < £{a) + c M =£ + c M such that 

||il (<t m , k) - M (a, 2k) || Tr < ^ for every k e N . 

Thus, \\U(a M ,k) - p\\Tr < \\tt(<j M ,k) - M(a,2k)\\ Tl + 
\\M(a,2k) - pWtt < ± + ^ = i for every k G N. So 
QC ix {p)<£ + c M - □ 

The construction of it is based to a large extent on classical 
algorithms that enumerate halting input qubit strings. Since it 
is in general impossible to decide unambigously by classical 
simulation whether some input qubit string \ip) is perfectly or 
only approximately halting for a QTM M, the UQTM il will 
also give some outputs of M which correspond to inputs that 
are only approximately halting. 



With some effort, this observation can be used to generalize 
the construction of il such that it also captures every e-halting 
input qubit string for M if e > is small enough, and gives 
the corresponding output. This leads to the following stability 
result. A proof and a more detailed reformulation can be found 
in El. 

Theorem 3.15 (Halting Stability): For every 5 > 0, there is 
a computable sequence a n (5) of positive real numbers such 
that every qubit string of length n which is a n (5) -halting for 
a QTM M can be enhanced to another qubit string which is 
only a constant number of qubits longer, but which makes il 
halt perfectly and gives the same output up to trace distance 
5. 

IV. Summary and Perspectives 

While Bernstein and Vazirani [3| have defined QTMs with 
the purpose to study quantum computational complexity, it 
has been shown in this paper that QTMs are suitable for 
studying quantum algorithmic complexity as well. As proved 
in Theorem 11.11 there is a universal QTM il that simulates 
every other QTM until the other QTM has halted, thereby 
even obeying the strict halting conditions that the control is 
exactly in the halting state at the halting time, and exactly 
orthogonal to the halting state before. 

Although the calculations in this paper were done for the 
QTM, it seems plausible that this construction of a "strongly 
universal" machine can be easily extended to other models 
of quantum computation as well. The only assumption is that 
the quantum computing device in question computes until it 
attains some halting state, dependent on the quantum input. 

In analogy to the classical situation, this makes it possible 
to prove that quantum Kolmogorov complexity depends on the 
choice of the universal quantum computer only up to an addi- 
tive constant, as shown in Theorem II. 21 In the classical case, 
this "invariance property" turned out to be the cornerstone for 
the subsequent development of every aspect of algorithmic 
information theory. We hope that the results in this paper will 
be similarly useful for the development of a quantum theory 
of algorithmic information. 

There are some more aspects that can be learned from the 
proofs of Theorems 11.11 and 11.21 One example is Lemma [3. 131 
which essentially states that indeterminate-length QTMs are no 
more interesting then fixed-length QTMs, if the length £(a) 
of an input qubit string a is defined as in Definition 12.11 This 
supports the point of view of Rogers and Vedral ifTTl to con- 
sider the average length 1(a) instead, that is, the expectation 
value of the length £. If the halting of the underlying quantum 
computer is still defined as in this paper, then our result applies 
to their definition, too. 

The construction of the strongly universal QTM il in the 
proof of Theorem 11.11 is such that il starts with a completely 
classical computation, followed by the application of classi- 
cally selected unitary operations. But the same steps (on the 
same input) can be done by a machine that has a purely 
classical control, selecting at each step of the computation a 
unitary transformation that is applied to an unknown quantum 
state (that was part of the input) without any measurement. 
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Thus, it seems that at least from the point of view of quantum 
Kolmogorov complexity QC S , it is sufficient to consider 
machines with a completely classical control. Such machines 
do not have the problem of "approximate halting" described 
in Subsection II- Al 

There may be interesting applications of extending algorith- 
mic information theory to the quantum case. One exciting per- 
spective is that in a quantum theory of algorithmic complexity, 
both the inherent notions of "randomness" of quantum theory 
and "algorithmic randomness" originating from undecidability 
results will occur (and maybe be related) in a single theory. 
One possible application of quantum Kolmogorov complexity 
might be to analyze a fully quantum version of the thought 
experiment of Maxwell's demon in statistical mechanics, since 
its classical counterpart has already proved useful for the 
corresponding classical analysis (cf. fHl ). 

Appendix 

Lemma A.l (Inner Product and Dimension Bound): 
Let Ti be a Hilbert space, and let |i/'i), ■ ■ ■ , \iPn) € % with 
|| |^)|| = 1 for every i G {1, . . . ,N}, where 2 < N G N. 
Suppose that 



< 



i 



N - 1 



for every i ^ j 



Then, dimH > N. 

Proof. We prove the statement by induction in N e N. For 
N = 2, the statement of the theorem is trivial. Suppose the 
claim holds for some N > 2, then consider N + 1 normalized 
vectors . . . , |"0JV+i) € 7^ where H is an arbitrary Hilbert 
space. Suppose that \(ipi\'4>j) \ < jt f° r every i j. Let P := 
1 - \ip N +i) (ipN+l\, then P\ipi) ^ for every i G {1, . . . , N}, 
and let 



Wi) := pm 



\9>. 



14) 

www ■ 



The \ifii) are normalized vectors in the Hilbert subspace H := 
ran(P) of H. Since || |^)|| 2 = (^|^) - \(Hi'N+i)\ 2 > 1- 



N 



1 it follows that the vectors \cpi) have small inner product: 



Let i 7^ j, then 



WM)\ 



< 



< 



II 14)11 -II K>ll 



1 



1 - 
1 

N 



1 

TP 



1 



1 

TP 



1 



1 



1 _ ^ \ AT ■ iV^ / A" 
Thus, dimW > A/', and so dimH > N + 1. 



□ 



Lemma A.2 (Composition of Unitary Operations): 
Let 7i be a finite-dimensional Hilbert space, let (I^)igN be a 
sequence of linear subspaces of H (which have all the same 
dimension), and let Ui : Vi — > Vi+i be a sequence of unitary 
operators on H such that X^fcLi \\Uk ~ 1|| exists. Then, the 
product IlfcLi ^ — . . . • U3 • U~2 • Ui converges in operator- 
norm to an isometry {/ : Vi — ► W. 



nf =1 ^-i 



< 



Proof. We first show by induction that 

SfcLi II ~ This is trivially true for 2V = 1; suppose it 
is true for A" factors, then 



N+l 
k=l 



< 



< 



< 



N+l 



N 



l[u k -l[u k 



k=l 



fc=l 



N 



N 



N+l 



i) n ^ 



fe=i 



fe=i 

AT 



/c=i 



N+l 

Ei 



By assumption, the sequence a n :— J22=i W^k ~ 1|| is a 
Cauchy sequence; hence, for every e > there is an N £ G N 
such that for every L,N> N £ it holds that £a.=l+i ||t4 - 
1|| < e. Consider now the sequence V n :— rife=l ^fc- ^ — 
X > AT., then 



WVn-V l \\ 



N L L 

n u k i[u k ii u k 

k=L+l 



k=l 



k=l 



< 



< 



N L 

n Uk - 1 n Uk 

k=L+l k=l 
N 

E ll^-l|l<£: 
k=L+l 

so (Ki)neN is also a Cauchy sequence and converges in 
operator norm to some linear operator U on V\. It is easily 
checked that U must be isometric. □ 

Lemma A.3 (Norm Inequalities): Let H be a finite- 
dimensional Hilbert space, and let \ip),\ip) G H with 
\<p)\\ = 1. Then, 



- 1*1 



Tr 



< 



Moreover, if p,a G T*(H) are density operators, then 

||P -Oil < H/0-O-llxr ■ 

Proof. Let A := |^)(^| - \ f)(ip\. Using flTSl 9.99], 

||A||| r = 1 - |<^)| 2 = (1- |(^|^)|) (1 + |(^|^)|) 



< 2-2\{ip\(p)\<2-2Re{ip\<p) 

= (^-^|V-^ = III^)-|^)I| 2 ■ 

Let now A := p — a, then A is Hermitian. We may assume 
that one of its eigenvalues which has largest absolut value is 
positive (otherwise interchange p and a), thus 



|A| 



max (u|A|u) 
I l«)ll=i 



max Tr(PA) 

P pi"0j., TrP=l 



< 



max Tr(PA) = ||A||Tr 
p proj. 



according to iTTSl 9.22]. 



□ 



Lemma A.4 (Dimension Bound for Similar Subspaces): 
Let H be a finite-dimensional Hilbert space, and let V, W C H 
be subspaces such that for every \v) £ V with || \v)\\ = 1 
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there is a vector \w) S W with || = 1 which sat- 

isfies || \v) - \w)\\ < e, where < e < 4(dim V_ 1)a is 
fixed. Then, dimVF > diml^. Moreover, if additionally 

, i /c\2— 2 dim V , , , 

£ — M J holds, then there exists an isometry 

U:V^W such that \\U - 1\\ <§V^(|) dimV 
Proof. Let {\vx), . .. , \vd)} be an orthonormal basis of V. By 
assumption, there are normalized vectors {|u>i), . . . , \iVd)} C 
W with || \vi) — \wi)\\ < e for every i. From the definition 
of the trace distance for pure states (see ifTHl (9.99)] together 
with Lemma IA.3I it follows for every i ^ j 



1 



\(Wi\Wj)\ 2 = || \Wi){Wi\ - \Wj){wj\ \\Tr 

> || \Vi){Vi\ ||Tr 
-|| \Vi){Vi\ - \Wi)(Wi\ \\tt 
-II \Vj){Vj\ - \wj)(Wj\ || T r 

> i - || K)- K)IMIh-> -K-)|| 

> 1 - 2e . 

Thus, | (tOi | tOj) | < 2^ < and it follows from 

Lemma IA.1I that dim W > d. Now apply the Gram-Schmidt 
orthonormalization procedure to the vectors {\wi)}f =1 : 
fc-i 

|e fc ) := \w k ) - 2_,{wk\ei)\ei) , |e fe ) : 



Use ||||g fc > 
calculate 

II \e k ) 



< II \e k ) 



|efc/ll 
w k )\\ and 



Iw/c) 



fc-i 

E 



< 



< 



»=i 
fc-i 

E 

i=l 
fc-1 



\(w k \ei - Wi}\ + \(w k \wi 



i>ll +2v/e 



Let A i 



|tOfc)|| for every 1 < fc < d. We will 
now show by induction that < 2^ f (f ) — 1 ■ This 
is trivially true for k = 1, since Ai = 0. Suppose it is true 
for every 1 < i < k — 1, then in particular, Aj < | by the 
assumptions on e given in the statement of this lemma, and 




Now define the linear operator U : V — > via linear exten- 
sion of t/|t>i) := |ej) for 1 < i < d. This map is an isometry, 
since it maps an orthonormal basis onto an orthonormal basis 
of same dimension. By substituting \v) — ^2k=i a k\ v k) an d 
using e < 4^ and the geometric series, it easily follows that 



U\v) 



if 



= 1. 



□ 



7^0, 



Lemma A.5 (Stability of the Control State): 
If |V>, b») G H n and || |^)|| = || 1^)11 = 1 and \v) 
then it holds for every QTM M and every t E No 

|<g/|A&(MM)l?/> - (g/l^ClvX^Dk/)! 

\(qf\Mh(\v}(v\)\ qf )-( qf \MU\v Q )(v \)\ qf )\ 

<|i-iib>ii 2 l ■ 

Proof. Using the Cauchy-Schwarz inequality, Lemma |A31 and 
the contractivity of quantum operations with respect to the 
trace distance (cf. ifTHl (9.35)]), we get the chain of inequalities 



< 
< 
< 



|Af&(|V)M) 
|M^(|^(^|)-M^(M^|)| 

|l^)(vi-b)(^iL ■ 



The second inequality can be proved by an analogous calcu- 
lation. □ 
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